PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSMUA_Achr7P00210_001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Zingiberales; Musaceae; Musa
Family HD-ZIP
Protein Properties Length: 679aa    MW: 75177.2 Da    PI: 5.6311
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSMUA_Achr7P00210_001genomeCIRADView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.36.3e-193893156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
               Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                           r++ +++t++q+++Le++F+++++p+ ++r eL ++lgL+  qVk+WFqN+R++ k
  GSMUA_Achr7P00210_001 38 RKRHHRHTPDQIQRLEAFFKECPHPDDRQRNELGRELGLEPLQVKFWFQNKRTQIK 93
                           567789**********************************************9988 PP

2START147.31.4e-462044204206
                            HHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
                  START   4 eeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                            + aa+el++  +++ep W+  +    e +n++e+ + f+ + +      ++ea+r+++vv m + +lv +l+d++ +W+  +     +
  GSMUA_Achr7P00210_001 204 QVAAEELLAISQLGEPIWTWGYdgfrETLNQEEYARAFPGGLGpkmegLRTEASRETAVVRMHTGRLVDILMDVS-KWSVFFSsivtR 290
                            6678899999999*****99999999************666559*******************************.***999999999 PP

                            EEEEEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEEC CS
                  START  79 aetlevissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepks 158
                            a+t+e i+          +  +m aelq+++plvp R+++f+Ry+++  +g wv+vdvS +++ +p      +R +++pSg++i++++
  GSMUA_Achr7P00210_001 291 ATTVEIITIIitilfigYVHSQMAAELQVPTPLVPlRESLFLRYCKR-HDGLWVVVDVSMEVRLNPM-----ARYRRRPSGCVIQQLP 372
                            9999999998888999455568999**********************.****************985.....79999*********** PP

                            TCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                  START 159 nghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                            n +skv+w+ehv+++++ +h +++s+v+s+la+gak+wv+t+ rqce+
  GSMUA_Achr7P00210_001 373 NDYSKVVWIEHVEVDDSGVHDMFKSMVNSCLAFGAKRWVSTMTRQCER 420
                            **********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.2E-202489IPR009057Homeodomain-like
SuperFamilySSF466898.77E-192595IPR009057Homeodomain-like
PROSITE profilePS5007117.1033595IPR001356Homeobox domain
SMARTSM003894.6E-193699IPR001356Homeobox domain
CDDcd000868.01E-193896No hitNo description
PfamPF000461.2E-163893IPR001356Homeobox domain
PROSITE profilePS5084835.409192423IPR002913START domain
SuperFamilySSF559614.53E-24192421No hitNo description
CDDcd088752.29E-82200419No hitNo description
SMARTSM002345.4E-29201420IPR002913START domain
PfamPF018528.0E-38205420IPR002913START domain
Gene3DG3DSA:3.30.530.201.4E-4323419IPR023393START-like domain
SuperFamilySSF559617.28E-13438630No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 679 aa     Download sequence    Send to blast
MDQLIPDVDE DFANTPGSEP TEGASEDELQ DHGSRGKRKR HHRHTPDQIQ RLEAFFKECP  60
HPDDRQRNEL GRELGLEPLQ VKFWFQNKRT QIKNQSEREE NQRLKDENDR LLSENLRLKE  120
ALTGACCPNC GGPPHLGEMS IDEQELQAEN ARLQDEIGRI SAIVAKYAEP DMTLNDSSRG  180
GEVVFRPHDR RTEMGRRTAL ALAQVAAEEL LAISQLGEPI WTWGYDGFRE TLNQEEYARA  240
FPGGLGPKME GLRTEASRET AVVRMHTGRL VDILMDVSKW SVFFSSIVTR ATTVEIITII  300
ITILFIGYVH SQMAAELQVP TPLVPLRESL FLRYCKRHDG LWVVVDVSME VRLNPMARYR  360
RRPSGCVIQQ LPNDYSKVVW IEHVEVDDSG VHDMFKSMVN SCLAFGAKRW VSTMTRQCER  420
LACLLATNSS SADIAANPDG TRNILKLAER MVVSFCSGVS GSVAHHWTNI SGNVADHVRI  480
RSRNNDGDPG RPPGVVLNAS TSIWLPITPK RVFELLRSES LRNEWDILLK GGMIHEAACI  540
ANGQQGGNCV SIYTVGLPEE ETTNMMILQE SCSDPTCSYV IYAPVDAAAI AAVLSGGDPN  600
HVALLPSGFA ILPDRPSSAV DHDGETDLGG GSLLTVAFQI LVFSGPGSKI SRGFIATVDG  660
LLSCTCDRIR TILSGNSVR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009408807.10.0PREDICTED: homeobox-leucine zipper protein ROC2-like
SwissprotQ0J9X20.0ROC2_ORYSJ; Homeobox-leucine zipper protein ROC2
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLM0TDG10.0M0TDG1_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr7P00210_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP79938147
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2